# MIPS based MCU Architecture

**Final Project Assignment Definition** 

Hanan Ribo

17.07.24

## Table of contents

| 1.  | Aim of the project                                        | 3  |
|-----|-----------------------------------------------------------|----|
| 2.  | Definition and prior knowledge                            |    |
| 3.  | Assignment definition                                     |    |
| I/  | /O devices connected:                                     | 4  |
| 4.  | Required Support of CPU Peripherals                       | 6  |
| 5.  | Interrupt Service BUS Protocol of a Single Cycle CPU:     |    |
| 6.  | Pin Planner                                               | 14 |
| 7.  | Host Interface (to ITCM and DTCM)                         | 14 |
| 8.  | Compiler, Simulator and Memory                            | 14 |
| 9.  | CPU and MCU Test                                          | 14 |
| 10. | MCU and PC communication using RS-232 interface (= Bonus) | 15 |
| 11. | Requirements                                              | 15 |
| 12. | <del>-</del>                                              |    |
| 13. | References                                                | 17 |

## 1. Aim of the project

- Design, synthesis and analysis of a simple (single cycle architecture) MIPS CPU core with Memory Mapped I/O, interrupt capability and Serial communication peripheral (as bonus)
- Understanding of CPU vs. MCU concepts
- Understanding in FPGA memory structure

## 2. Definition and prior knowledge

The aim of this project is to design CPU MIPS based MCU. The CPU will use a Single Cycle MIPS architecture and must be capable of performing full instruction set of simple MIPS (given as an appendix). The design will be located on Altera Board. The MIPS architecture is Harvard architecture in order to increase throughput and simplify the logic. For additional information regarding MIPS CPU, Architecture, ISA and instructions see MIPS technical documentation [1].

## 3. Assignment definition

The architecture must include a MIPS ISA compatible CPU with data memory DTCM and program memory ITCM for hosting the program data and code segments. The block diagram of an architecture is given in Figure 1. The CPU will have a standard MIPS register file. The top level and the MIPS core must be structural. The design must be compiled and loaded to the Altera board for testing. A single clock (CLK) should be used in the design.

Note: use push-button KEY0 as a System RESET (brings the PC to the first program instruction)

MIPS CPU **Program Memory BUS** Data Memory BUS Control Control MIPS Address Address core Data Data I/O interface I/O interface I/O interface Program Data Clk Chache Chache Memory Memory Device 1 Device 2 Device n Device interface registers Clock = 50MHzClk

3

• The GPIO (General Purpose I/O) is a simple decoder with buffer registers mapped to data address (Higher than data memory) as given in the assembly code examples that enables the CPU to output data to GPIO devices as LEDs and 7-Segment and to read the Switches array value.



The Data Address Space is 32-bit WORD aligned where the physical address space it is the lowest 12-bit  $0 \dots 0A_{11} \dots A_0$  with partial mapping.

Figure 2: Data Address Space contains Data Memory and Memory Mapped I/O

#### I/O devices connected:

In the hardware test case, you will have to test an **ALU digital system** onto D10-Standard FPGA board.

- Board ten switches (SW9-SW0) and push four debounced pushbuttons (KEY3-KEY0) will be used as Input interface.
- Board 10 red LEDs (LEDR9-LEDR0) and six 7-segment displays (HEX5-HEX0) used as Output interface.
- Connections between the 2x20 GPIO Expansion Header and Cyclone V SoC FPGA





Figure 3: Primitive GPIO peripheral connection using Memory Mapped I/O approach

• The CPU will be based on the *standard 32bit MIPS ISA* and the Instructions will be 32 bit wide. The following table shows the MIPS instruction format. For more information, see MIPS technical documents [1].

| Type | -31-       | format (bits) |        |        |           | -0-       |
|------|------------|---------------|--------|--------|-----------|-----------|
| R    | opcode (6) | rs (5)        | rt (5) | rd (5) | shamt (5) | funct (6) |
| I    | opcode (6) | rs (5)        | rt (5) | immedi | ate (16)  |           |
| J    | opcode (6) | address (26)  |        |        |           |           |

**Table 1: MIPS Instruction format** 

The Data address space is 4kB. Memory latency will be according to Table 2

| Memory                          | Write Latency | Read Latency |
|---------------------------------|---------------|--------------|
| Program Memory (I-Cache / ITCM) | 1 clk         | 1 clk        |
| Data Memory (D-Cache / DTCM)    | 1 clk         | 1 clk        |

**Table 2 : Memory size and latency** 

## 4. Required Support of CPU Peripherals

- i. Seven GPIO ports (six Output and one Input) for peripherals depicted in page 5
- ii. KEY [3-1]: support array of three pushbuttons as input device
- iii. Basic Timer with output compare capability:



#### BTCTL, Basic Timer Control Register



#### **BTCNT**, Basic Timer Counter



#### BTCCRx, Basic Timer Compare Register x



Basic Timer Output compare registers x={0,1} Compare data is written to each BTCCRx and automatically transferred to BTCLx. BTCLx holds the data for the comparison to the timer value in the Basic Timer Register, BTCNT.

Note: the register value is zero on RESET.



#### iv. Unsigned Binary Division Multicycle Accelerator:



The divider results are ready after N DIVCLK cycles after loading a value to the second operand DIVISOR, i.e., 32 DIVCLK cycles in our case of N=32.

#### v. <u>USART Peripheral Interface</u>, <u>UART Mode</u> (<u>Bonus 20%</u>):

The required communication peripheral is the universal **USART** (synchronous/asynchronous receive/transmit) peripheral interface in **UART** Mode only (degenerated **USART**).

You are given VHDL design code that need to be adapted to the next UART mode features.

#### *UART mode features include:*

- 1-start bit, 1-stop-bit, 8-bit data with non-parity
- Independent transmit and receive shift registers
- Separate transmit and receive buffer registers
- LSB-first data transmit and receive
- Programmable baud rate support
- Status flags for error detection
- Independent interrupt capability for receive and transmit

#### UCTL, USART Control Register

|                                                                                                 | 7       | 6     |                                                                                  | 5                     | 4             | 3                                    | 2   | 1             | 0          |
|-------------------------------------------------------------------------------------------------|---------|-------|----------------------------------------------------------------------------------|-----------------------|---------------|--------------------------------------|-----|---------------|------------|
|                                                                                                 | BUSY    | OE    |                                                                                  | PE                    | FE            | BAUDRATE                             | PEV | PENA          | SWRST      |
|                                                                                                 | r       | r     |                                                                                  | r :                   | r :           | rw                                   | rw  | rw-           | rw         |
| ,                                                                                               | SWRST   | Bit 0 | Sof<br>0<br>1                                                                    |                       | I. USART re:  | set released<br>ic held in res       |     | า             |            |
| PENA Bit 1 Parity enable  0 Parity disabled 1 Parity enabled. Parity bit is generated (TXD) and |         |       |                                                                                  | and expecte           | d (RXD).      |                                      |     |               |            |
| F                                                                                               | PEV     | Bit 2 | Parity select. PEV is not used when parity is disabled.  Odd parity  Even parity |                       |               |                                      |     |               |            |
| Е                                                                                               | AUDRATE | Bit 3 | Baud Rate value 0 9600 1 115200                                                  |                       |               |                                      |     |               |            |
| F                                                                                               | E       | Bit 4 | Framing error flag  O No error  Character received with low stop bit             |                       |               |                                      |     |               |            |
| P                                                                                               | E       | Bit 5 | •                                                                                |                       |               |                                      |     |               |            |
| C                                                                                               | ÞΕ      | Bit 6 |                                                                                  | RXBUF before No error |               | is set when a<br>ous characte        |     | s transferred | d into     |
| E                                                                                               | BUSY    | Bit 7 | This<br>0<br>1                                                                   | UART mo               | odule inactiv | mit or receiv<br>e<br>nitting or rec | •   | is in progre  | ss (busy). |

#### RXBUF, USART Receive Buffer Register

| 7                     | 6                     | 5                     | 4  | 3                     | 2  | 1  | 0          |
|-----------------------|-----------------------|-----------------------|----|-----------------------|----|----|------------|
| <b>2</b> <sup>7</sup> | <b>2</b> <sup>6</sup> | <b>2</b> <sup>5</sup> | 24 | <b>2</b> <sup>3</sup> | 22 | 21 | <b>2</b> 0 |
|                       |                       |                       |    |                       |    |    |            |

RXBUFx Bits 7-0

The receive-data buffer is user accessible and contains the last received character from the receive shift register. Reading RXBUF resets the receive-error bits, and RXIFG.

#### TXBUF, USART Transmit Buffer Register

| 7                     | 6                     | 5                     | 4  | 3                     | 2  | 1  | 0  |   |
|-----------------------|-----------------------|-----------------------|----|-----------------------|----|----|----|---|
| <b>2</b> <sup>7</sup> | <b>2</b> <sup>6</sup> | <b>2</b> <sup>5</sup> | 24 | <b>2</b> <sup>3</sup> | 22 | 21 | 20 |   |
| rw                    | rw                    | rw                    | rw | rw                    | rw | rw | rw | • |

TXBUFx Bits 7-0

The transmit data buffer is user accessible and holds the data waiting to be moved into the transmit shift register and transmitted on TXD. Writing to the transmit data buffer clears TXIFG.

**Note:** for UART module reference, see block diagram of MCUs that use acquainted with.

#### vi. Interrupt controller:



#### Notes:

- The BTIFG, DIVIFG flags are reset automatically when the interrupt is serviced.
- ii. **RXIFG** is automatically reset if the pending interrupt is served or when **RXBUF** is read.
- iii. TXIFG is automatically reset if the interrupt request is serviced or if a character is written to TXBUF
- iv. The **KEYIIFG** is reset manually with software (**BTIFG**, **DIVIFG**, **RXIFG**, **TXIFG** as well).
- v. As part of CPU services an interrupt, GIE is clear (in HW) means DINT of other interrupts. Symmetrically, as part of CPU returning from interrupt, GIE is set (in HW) means EINT of interrupts (back the origin state).

#### **Handling an interrupt:**



#### **Handling interrupts from several sources:**



#### IE, Interrupt Enable Register



IEx Bit x 0 Interrupt not enabled 1 Interrupt enabled

#### IFG, Interrupt Flag Register



IFGx Bit x 0 No interrupt pending 1 Interrupt pending

#### TYPE, Interrupt Type Register

| 7   | 6   | 5     | 4 | 3 | 2  | 1 | 0   |
|-----|-----|-------|---|---|----|---|-----|
| 0   | 0   | TYPEx |   |   |    | 0 |     |
| r-0 | r-0 | r-0   | n | n | r. | r | r-0 |

| TYPE Contents | Interrupt Source  | Interrupt Flag | Interrupt<br>Priority | •                        |
|---------------|-------------------|----------------|-----------------------|--------------------------|
| 00h           | RESET             | NMI            | Highest               | (Non)-Maskable Interrupt |
| 04h           | UART status error | -              |                       | ٦                        |
| 08h           | UART RX           | RXIFG          |                       |                          |
| 0Ch           | UART TX           | TXIFG          |                       |                          |
| 10h           | Basic Timer       | BTIFG          |                       |                          |
| 14h           | KEY1              | KEY1IFG        |                       | Maskable Interrupt       |
| 18h           | KEY2              | KEY2IFG        |                       |                          |
| 1Ch           | KEY3              | KEY3IFG        |                       |                          |
| 20h           | DIVIDER           | DIVIFG         | Lowest                | J                        |



## 5. Interrupt Service BUS Protocol of a Single Cycle CPU:

#### 1. CPU services an interrupt request (latency of two or three cycles):

This ongoing event is triggered on falling edge of an INTA signal (the ensuing cycle after INTR is set to '1')

- *i.* GIE=0 (bit k0[0] = 0)
- ii. Writing content of register TYPE on Data BUS

<u>Note:</u> cannot be written on Address BUS because CPU is the only BUS master (executes this protocol).

- iii. Set INTA (INTA='1')
- *iv.* Serial emulation execution of *load* (of TYPE content) and *jal* (to Mem [TYPE] content) where \$k1=PC+4

#### 2. <u>CPU returning from service of an interrupt request</u> (latency of one cycle):

This event happens as a part of *reti* (jr \$k1) execution GIE=1 (bit \$k0[0] = 1) *and* go to return address stored in \$k1

#### 6. Pin Planner

Only MCU IO devices need to be connected to FPGA location legs via pin planner. Location legs that used for proof of work phase (signal tap) need to be removed at the final step.

## 7. Host Interface (to ITCM and DTCM)

After the last system developing stage, you will be given a made JTAG based code wrapper of communication interface to L1 memory caches (data and program) in order to upload/download their content without reloading the system hardware design onto the FPGA chip.

## 8. Compiler, Simulator and Memory

The MARS compiler and simulator, or any other can be used to compile and simulate the assembly code. MARS compiler can also export the memory contents into the file in format that VHDL can easily read. It can also simulate a cache performance.

The mars compiler, installation instructions and documentation are available at: http://courses.missouristate.edu/KenVollmar/MARS/

#### 9. CPU and MCU Test

- a) Mandatory: supporting all of given test<sub>i</sub>.asm assembly source files.
- b) Bonus (= under condition of working properly): Using serial communication support application of PC side (as Hyper-Terminal, Tera-Term, puTTY etc) and write code for MCU side that support the next menu (transmitted from MCU to PC):

#### Menu

- 1. Count up from 0x00 onto LEDG with delay ~0.5sec
- 2. Count down from 0xFF onto LEDR with delay ~0.5sec
- 3. Clear all LEDs
- 4. On each KEY1 pressed, send the massage "I love my Negev"
- 5. Show Menu

## 10. MCU and PC communication using RS-232 interface (= Bonus)

The FTDI driver emulates a standard PC serial port such that the USB device may be communicated with as a standard RS-232 device. The driver allows direct access to a USB device via a DLL interface.



## 11. Requirements

You have to do the following tasks:

- ModelSim Simulation with maximal coverage.
- Analyze the critical path, explain where it is in your VHDL design and find the maximal operating clock.
- Load the design onto the FPGA and verify the simulation results.
- Run the required assembly source codes and explore them.

The following must be presented in **final.pdf** report file.

- 1. Top level block review diagram of your design.
- 2. For each block in the top level design:
  - RTL Viewer results
  - Logic usage for each block (Combinational and Flip-Flops).
  - Graphical description (a square with ports going in and out).
  - Port Table (direction, size, functionality).
  - Short description.

- 3. Maximum (Critical) path of your design explain where it is in the code and how it is possible to optimize if you would have more time. What is the maximum clock frequency?
- 4. Minimum path analysis.
- 5. Documentation Style Content with page numbers, Images and tables will be numbered. The caption of an images and tables below the images or tables.
- 6. Elaborated analysis and wave forms:
  - Maximal Frequency and critical paths from Timing Analyzer.
  - Proof of work using Signal Tap shot screens.
     Recall that, proof of work using Signal Tap is mandatory.
  - <u>One</u> basic waveform to explain the system timing.

#### Design requirements:

- 1. The design must be well commented.
- 2. The system must work from only one clock.
- 3. System RESET (KEY0) must be synchronous.
- 4. Conclusions
- 5. A ZIP file in the form of **id1\_id2.zip** (where id1 and id2 are the identification number of the submitters, and id1 < id2) *must be upload to Moodle only by student with id1* (any of these rules violation disqualifies the task submission).
- 6. The **ZIP** file will contain the next six subdirectories (*only the exact next sub folders*):

| Directory | Contains                                      | Comments                                     |
|-----------|-----------------------------------------------|----------------------------------------------|
| DUT       | Project VHDL / Verilog HDL files (you must    | Only VHDL / Verilog HDL files, excluding     |
|           | use only a single version of DUT files which  | test bench Note: your project files must be  |
|           | are adapted to ModelSim and Quartus IDEs      | well compiled (in ModelSim and Quartus       |
|           | under method of conditional compilation using | separately) without errors as a basic        |
|           | generic map of Boolean parameter)             | condition before submission                  |
| TB        | VHDL files that are used for test bench       | Only one <b>tb.vhd</b> for the overall DUT   |
| SIM       | ModelSim DO files                             | Only for <b>tb.vhd</b> of the overall DUT    |
| DOC       | Project documentation                         | Readme.txt and final.pdf full report file    |
| Quartus   | Signal Tap files used in project verification | Do not place files that are not relevant for |
|           | Project SOF file                              | compilation or is a result of compilation!   |
|           | Project SDC file                              |                                              |
| CODE      | The assembly source code of clause 6b         |                                              |

**Table 3: Directory Structure** 

## 12. Grading policy

| Weight | Task              | Description                                                     |
|--------|-------------------|-----------------------------------------------------------------|
| 10%    | Full              | The "clear" way in which you presented the requirements and the |
|        | Documentation     | analysis and conclusions on the work you've done                |
| 90%    | System Execution, | The correct analysis of the system (under the requirements)     |
|        | Analysis and Test |                                                                 |

**Table 4: Grading** 

## Late submissions will be not gotten.

#### 13. References

- [1]. MIPS32® Architecture for Programmers Volume I to III (from Moodle under Final Project)
- [2]. ALTPLL User Guide: <a href="http://www.altera.com/literature/ug/ug\_altpll.pdf">http://www.altera.com/literature/ug/ug\_altpll.pdf</a>
- [3]. Altera RAM user guide: <a href="http://www.altera.com/literature/ug/ug\_ram\_rom.pdf">http://www.altera.com/literature/ug/ug\_ram\_rom.pdf</a>
- [4]. Altera MegaFunction User Guide:

www.altera.com/literature/ug/ug\_intro\_to\_megafunctions.pdf

[5]. Bin2Hex utility

32bit - <a href="http://www.keil.com/download/docs/113.asp">http://www.keil.com/download/docs/113.asp</a>

64bit - http://www.ht-lab.com/freeutils/bin2hex/bin2hex.html